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IMPROVED BOUNDS FOR ARITHMETIC PROGRESSIONS IN 

PRODUCT SETS 

DMITRII ZHELEZOV 


Abstract. Let B be a set of natural numbers of size n. We prove that the 
length of the longest arithmetic progression contained in the product set B.B = 
{bb'lb^b' £ B} cannot be greater than 0(n log n) which matches the lower bound 
provided in an earlier paper up to a multiplicative constant. For sets of complex 
numbers we improve the bound to O e (n 1+e ) for arbitrary e > 0 assuming the 
GRH. 


1. Introduction 

In [8j we have studied a relationship between the additive structure and the size 
of a product set in the following sense. It was shown that a product set B.B = 
{bb'\b,b' e B} of a set of natural numbers B of size n cannot contain arithmetic 
progressions longer than In the present note want to improve this 

result and show that a better bound actually holds. 

Theorem 1.1. Suppose that B is a set of n natural numbers. Then the longest 
arithmetic progression in B.B has length at most 0(n log n). 

In the same paper [S] it was shown that there is a set B of n natural numbers 
such that B.B contains an arithmetic progression of length 12(nlogn), so the 
bound of Theorem 11.11 is tight up to a multiplicative constant. 

In the second part of the paper we generalize Theorem 11.11 to get a slightly 
weaker bound for complex numbers. Unfortunately, we need certain sharp esti¬ 
mates (namely, an effective version of the Chebotarev density theorem) so the 
result is conditional on the Generalized Riemann Hypothesis (GRH). 

Theorem 1.2. Suppose that B is a set of n complex numbers. Then the longest 
arithmetic progression in B.B has length at most O t (n 1+e ) for any e > 0, assuming 

GRH. 
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So the problem to unconditionally improve the bound 0(n 3 / 2 ) obtained in [8] for 
sets of complex numbers remains open. It is interesting to note that the polynomial 
congruences arising in the proof of Theorem II.21 were studied by Bourgain, Garaev, 
Konyagin and Shparlinski [3] in a slightly different context. 

Remark. It is possible to track down the explicit dependency on e, so in fact 
Theorem 11.21 holds with the absolute bound 

( 1 I loglogri 

n 

2. Notation 

following notation will be used in this paper: 
f(n) = 0(g(n)) means that lirnsup n ^ (X) ^ < oo. 
f(n) = Q(g(n)) means that g(n) = 0(f(n)). 

X <C Y. resp. X Y means X = 0(Y), resp. Y = O(X). If not stated 
explicitly, when using such notation, we assume that X and Y depend on 
a large parameter n, which is normally the size of the set in question. For 
example, for two sets A and B, |7l| \B\ means that |R|/|A| is bounded 

from above by a constant independent of the sizes of A and B. 

When dealing with arithmetic progressions, we will often write {a + d[IV]} 
as a shortcut for the set 

{a + di ; i — 0,..., N — 1}. 

Note, that N is then the size of the set. Likewise, {c(a + d[iV])} is just 
{ca + cd[N]} and for an arbitrary function /(•), 

urn )} = I/O; i = 0 ,...,JV-l}. 

(5) Let H be a fixed graph. Then ex(n, H) denotes the maximal number of 
edges among all graphs with n vertices which does not contain H as a 
subgraph. In particular, ex(n, C*,) denotes the maximal number of edges a 
graph with n vertices which avoids cycles of length k. 

(6) Let p be a prime, then d = ord p (n) denotes the maximal power of p such 
that p d | n. 

(7) For a natural number n, uj{n) denotes the number of distinct prime factors 
of n. u(n, k ) counts only the prime factors p > k. 

3. Integer arithmetic progressions 

Let A be a set of natural numbers of size N. One of the obstructions which 
prevents A from being a subset of a product set of considerably smaller size (say of 
order \/~N) is a large number of distinct prime factors of its elements. In particular, 
this is the case if A is an arithmetic progression and if the size of the elements of 


The 

( 1 ) 

( 2 ) 
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A is bounded in terms of N. If A is an arithmetic progression, it is convenient to 
assume that A is of the form 

A={D(r + d[N})}, ( 1 ) 

with gcd(.Dr, d) = 1, which one can do without loss of generality (by Lemma 1 of 
IB])- The exact claim is then as follows (see Lemma 4 of [S]) . 

Lemma 3.1. Let gcd (Dr,d) = 1 and A = {D(r + d[iV])} be an arithmetic pro¬ 
gression contained in a product set B.B with \B\ — n. If d < N k , r < N k , 
k = o(loglV) then N < 36fcnlogn for sufficiently large n. 

We are going to prove that either the hypothesis of Lemma 13.11 is fulfilled or 
there are many distinct prime divisors of the elements of A. 

The following lemma is a slight modification of an inequality due to Erdos as 
cited in [[5J. The proof is elementary yet illuminating. 

Lemma 3.2. Let gcd (d,r) = 1 and II = r(r + d)... (r + (N — l)d). For every 
prime factor p of II, leave out a term r + id with the maximal ord p , that is, for 
which p m \r + id and p m+1 divides none of the elements (resolving ties arbitrarily 
and obviously a term is removed at most once). Let M be the total number of 
elements that are left out. Then 

r(r + d)...(r + (N - 1 - M)d) < (N - 1)!. (2) 

Proof. After the elimination process described in the statement of the lemma is 
finished, the remaining terms have only prime factors less than N coprime with 
d. The order of such a prime p is at most 


that is, the power of p in (N — 1)!. Therefore, the product of the remaining terms 
divides (N — 1)!. The claim of the lemma then follows as the left hand side is 
certainly less than or equal to the product of the remaining terms. 

□ 

Corollary 3.1. Either both d,r < N 2 or M > N/2 assuming N is sufficiently 
large. 

Proof. Follows from ([2]). □ 

Before we proceed with the proof, let us define an auxiliary bipartite graph 
G(A, B.B ) which in general can be constructed for any sets A and B whenever 
A C B.B. The color classes of G are just two copies of B and for each a G A 
we pick a unique representation a = 6162 (for example, take the lexicographically 
smallest pair) and place an edge ( 61 , 62 ) in G. Record that V(G) = 2\B\ and 
E{G) = \A\. We will call this graph the containment graph of A in B.B. 
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Proof, [of Theorem 11.1] 

By Corollary 13.11 and Lemma 13.11 it suffices to handle the case M > N/2. 
Let pi, be the primes and Gp, the corresponding elements of A/D = 

{r + d[lV]}, left out by the process described in Lemma [3721 applied to the product 

N -1 

n = II ° = IK' + *0- 

a&A/D 3=0 

First, we prune from the set A' = {op,..., cim} all elements a* with pi < N/2. 
By the Prime Number Theorem, o(N) elements are pruned, so with a slight abuse 
of notation we may assume that M > N/3 and pt > N/2. Then, each pi divides 
at most two elements, so we can pick a subset of primes P = {pi} and a subset 
A" C A' with | A"\ > N/6 , such that each p e P divides exactly one element in A". 
Note, that A" is as well a subset of A/D , and what we are going to show is that 
DA" C B.B implies \B\ = Q(N). Clearly, this implies the claim of the theorem. 

Let G(DA ", B.B ) be the containment graph of DA" in B.B. We claim that G 
contains no cycles. Suppose not and C = (&i....& 2m ) is a cycle in G (C is even 
since G is bipartite). By construction, there is a prime p which divides b^bi/D 
and does not divide bibi + \/D for i > 1 (indices are taken modulo m). In other 
words, ord p (&i& 2 ) > ord p (.D) and ord p (&i 6 j+i) = ord P (D) for i > 1. On the other 
hand one can write 

J [ bib; . j = bib; . I . 

i is even i is odd 

which leads to a contradiction, as the order of p on the left hand side is greater 
than on the right hand side. Thus, \A"\ = E(G) < V(G) — 2\B\ and we are done. 

□ 


Following the same route, we can state the following general proposition. 


Proposition 3.1. Let A be a set of integers of size N such that for every a e A 
holds the bound a/ gcd(kl) < N Kl , where gcd(kl) denotes the GCD of all the 
elements in A. Suppose that A e B.B for some set B and 


u 



a 

gcd(A) 



> M. 


Then \B\ — . 


Proof. Because of the bound a/ gcd(kl) < N hl each element of {a/ gcd(zl) : a G 
A} can have at most 0(1) distinct prime factors which are greater than K 2 N. 
Thus, we can choose 0(M) primes P = {p;} and A' = {a*} such that p, > K 2 N 
and each p * divides exactly one element of A'/ gcd(Ll). Applying verbatim the 
argument of the proof of Theorem 11.11 we then conclude that \B\ = f2(|A'|) = 
D Ki ,k 2 {M) □ 
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4. The case of complex numbers 

We now switch to the complex case. In [8] it was shown that if B is a set 
of complex numbers and B.B contains an AP A with |A| = Q (|L?| 3 / 2 ), one can 
construct a set B' of integers such that B'.B' contains an AP of the same length. 
Though we don’t see how this argument extends to shorter sequences, it is still 
possible to deduce some structural information from the fact that the product set 
contains a long AP. 

As we are dealing with complex numbers now, it is safe to assume that A = 
{r+[A^]}, rescaling each element of B by a square root of the difference if necessary. 
Let G(A , B.B ) be the containment graph of A in B.B. Then, if we consider r as a 
variable, any cycle in G implies a polynomial relation in r with integer coefficients, 
so in fact r must be algebraic, and the degree is controlled by the cycle length. 

Before we proceed, recall that the height of an integer polynomial P is defined 
as the largest (by absolute value) coefficient of P. It will also convenient for our 
purposes to define the height of an algebraic number a as the height of its primitive 
polynomial, that is, the integer polynomial of minimal degree and minimal in 
absolute value leading coefficient, of which a is a root. 

Lemma 4.1. If G(A, B.B) contains a 2k cycle then r is an algebraic number of 
degree at most k and height at most 0(N k ). 

Proof. Let ( b 0 b\... & 2 fc-i) be a cycle in G. Then we have 

]j[ bibi+1 = &A+1, 

i is even i is odd 

or after the substitution &A+i = aj p) = r + j for some j (i) we have a polynomial 
equation in r with integer coefficients 

n ( r +iW)= n ( r+ d( i ))- 

i is even i is odd 

The leading term cancels out and we arrive at an integer polynomial of degree less 
than k. The polynomial is not identically zero since the j s are distinct. Moreover, 
the coefficients of the polynomial are bounded by 0(N k ). Indeed, for 0 < l < k 
the coefficient in the term r l is a sum of {k — Z)-fold products of j s, and each j is 
in turn bounded by N. The number of summands is bounded by 2(j), thus each 
coefficient is bounded by 0(N k ) as 2k < V(G) < N. □ 

Let K = Q(r) be the held extension with r adjoined. The following lemma, 
which is a straightforward generalization of [8] Lemma 7, allows us to assume that 
in fact all the elements we are dealing with lie in K. 

Lemma 4.2. Let A C B.B and A C K for some field K. Then there is B' with 
\B'\ < 2\B\ s.t. B' C K and A C B'B'. 
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We will need some estimates from number theory to estimate the number 7r(P, x ) 
of primes p < x such that P(-) = 0 has a solution modulo p, where P G Z[x], 
that is, P has a linear factor in F p . ffere we record a lemma we will use, which is 
a consequence of a certain effective version of the Chebotarev Density Theorem. 
The original statement, proved by Lagarias and Odlyzko, is somewhat technical, 
so we defer it to the Appendix. 

Lemma 4.3. Assume P is an irreducible integer polynomial of degree d with 
coefficients bounded by 0(N k ). Then, assuming GRH, 

» ( P.JV) = tU.(^L). (3) 

To close the argument, we now prove the size of B is actually controlled by the 
degree and height of r, which would imply that either the containment graph does 
not contain small even cycles and thus sparse, or B is not very large. 

Lemma 4.4. Suppose A = {r + [N]} C B.B. Let k be fixed and r is an al¬ 
gebraic number of degree k and height 0(N k ). Then, assuming GRH, \B\ = 
n d , k (N/\ogN). 

Proof. Let K = Q(r)/Q, so k — [Q(r) : Q], By Lemma 14.21 there is a set B' of 
algebraic numbers in K of degree at most k such that B'.B' contains {r + [TV]} 
and \B\ 3> \B'\. 

First, we want to make all elements of B' algebraic integers. Let M be the LCM 
of the denominators of bs in B' (by the denominator of an algebraic number a we 
mean the minimal positive integer D s.t. Da is integer in Q(r)). Multiplying 
every element by M we get a new set B' of algebraic integers (we use the same 
letter to shorten the notation) such that B'.B' contains 

A= {M 2 (r + [N])}. 

Next, since r is of height 0(N k ), the denominator of r is bounded by 0(N k ), 
so there is a rational integer m = 0(N k ) s.t. mr is an integer in Q(r) of height 
0(N k2 ). Denote r' = rm and take B 2 = mB' U B' , so 

A 2 = {M 2 (r' + m[N])} C B 2 .B 2 . 

Now we have constructed a new set B 2 of algebraic integers such that B 2 .B 2 
contains an AP 

A 2 = {M 2 (r' + m[N})} 

with m = 0(N k ) and r' is an algebraic integer of height 0{N k ~). 

We claim that the norm A'Qp.pQ^r' + mi) is an integer polynomial in i of degree 
k and coefficients bounded by 0(N k2 ). Indeed, let aj,j = 1... k be the distinct 
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C-embeddings of Q(r). We have 

k 

N(r' + mi) = m k N(r + i) = m k + i) — (—1 ) k P(—i)m k , 

j =i 

where P is the minimal polynomial of r. On the other hand, by our choice of m, 
m k P{r) is the minimal polynomial of mr (since it is an integer monic polynomial 
with respect to mr of degree k) which in turn has integer coefficients bounded by 
0(N k2 ). 

The final step is to take norms. Let 

B 3 = {N(b ) I b e b 2 } 


and 

^3 = {iV(a) | a G A 2 }. 

Now B 3 and A 3 consist of rational integers and 

A 3 = {M 2d f ([N])} C B 3 .B 3 , 

where / is a polynomial of degree k and height 0(lV fc2 ). We then apply Lemma 
14.31 to deduce that there are at least CkN/ log N distinct primes p < N such that 
/ has a root modulo p. In other words, if p < N is such a prime then there is an 
index i s.t. f(i ) = 0 mod p. By the Prime Number Theorem, at least half of such 
primes are greater than CkN/A for large N, so 



> 


C k N 

2 log AT 


(4) 


On the other hand, f(i) = Ok(N k " +1 ) since the coefficients are bounded by 
0(N k 2 ) and the degree of / is k. In particular, certainly gcd(/[iV]) = Ok(N k " +1 ) 
which together with (j3J) gives 


co 



a 

gcd(A 3 ) 



= co 



m L.O 

gcd(/[iV])’ 4 J 



Now everything is ready to apply Proposition 13.11 to the containment A 3 C 
B 3 .B 3 and conclude that \B 3 \ = O*,(IV/logIV). Since |5 3 | <s; \B\, the claim 
follows. 

□ 


It remains to apply a well known result which bounds the number of edges in a 
2A;-cycles free graph. 


Proof, [of Theorem 11.2] 
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Fix e > 0 and let k — [ 7 ] • If G(A,B.B ) does not contain 2fc-cycles, then by 
the bound of Bondy and Simonovits [ 2 ], 


\A\=E(G) = O k (\B\ 1+1 ' k ), 


so we are done. 

Otherwise, by Lemma 14.11 r is an algebraic number of degree at most k 
height at most 0(N k ). Then Lemmas 14.11 and ITT imply |A| = O fc (|i?| 1+0 ( 1 )). 


and 

□ 
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Appendix 

Let A be a Dedekind domain, K its quotient field, L a finite extension of K with 
GalfL/ K) = G. Let also p be a prime ideal in A that does not ramify in L and 03 
be an ideal of B = Op lying over p. Recall that if L/K is abelian, the Frobenius 
element (see e.g. [S]) is a unique element a v G G such that for all a G Op holds 

cr p (a) = a*° K ^ (mod p O l ). 
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For a non-Abelian extension L/K, the Frobenius symbol depends on the choice 
53 lying over p, and we have for all a £ Ol 

a<s(a) = a*° K ^ (mod 53). 

In this case all such elements a® are conjugated, and the Frobenius symbol 
[^y-\ is defined as the corresponding conjugacy class of G. 

The crucial property of the Frobenius symbol is that it determines the decompo¬ 
sition type of p in Ol- hi particular, if L is the splitting held of a polynomial / and 
p does not ramify (i.e. does not divide the discriminant of /), the cycle structure 
of cr p , viewed as a permutation of the roots of /, corresponds to the degrees of the 
irreducible factors of / modulo p in (Ok/p)[X\- hi particular, / has a root mod 
p if and only if cr p fixes at least one root, which corresponds to a linear factor. 
Thus, to estimate the number of such primes, it suffices to estimate the number 
of prime ideals p such that the corresponding conjugacy class contains at least 
one 1-cycle. For simplicity, we can take the conjugacy class containing only the 
identical permutation, i.e. take only prime ideals such that / factors completely 
into linear factors mod p. The desired estimate is then given by the Chebotarev 
Density Theorem, and for our purposes we need the quantitative effective version 
proved by Lagarias and Odlyzsko under GRH. 


Theorem 5.1 (Lagarias and Odlyzko, [7]). Let L/K be a Galois extension of 
number fields, df the absolute discriminant of L, ul the degree of L over Q. Let 
G be the Galois group of this extension and C is a conjugacy class of G. For all 
x > 1, let ttc ( x ) denote the number of prime ideals p of K of norm less than or 


equal to x, that do not ramify in L and such that 
GRH, 


L/K 


= C. Then , 


assuming 


\nc( x ) ~ ]SrLi(a;)| < c (j^-y/x\og(d L x lGl ) + \C\ log {d L 


\G[ 

where c is an absolute constant. 


\G\ 


To deduce Lemma 14.31 it remains to take L the splitting held of our polynomial 
P, C = {id} (which is in fact is more restrictive) and note that the discriminant is 
bounded by N° d F 1 \ while the order of the Galois group is bounded by d\ = 0^(1). 
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